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Abstract 



We explore the ability of three-particle templates to distinguish color neutral objects 
from QCD background. This method is particularly useful to identify the standard 
model Higgs, as well as other massive neutral particles. Simple cut-based analysis in 
the overlap distributions of the signal and background is shown to provide a significant 
rejection power. By combining with other discriminating variables, such as planar flow, 
and several variables that depend on the partonic template, three-particle templates 
are used to characterize the influence of gluon emission and color flow in collider events. 
The performance of the method is discussed for the case of a highly boosted Higgs in 
association with a leptonically-decaying W boson. 



1 Introduction 



Processes with top quarks, standard model Higgs and W/Z jets play a key role in high energy 
collisions. They allow for tests of perturbative QCD and are important backgrounds for more 
exotic phenomena. Over the past few years, scenarios have been proposed in which these 
heavy particles are produced at large transverse momentum (i ll • At high enough p?, their 



decay products will appear as heavy, collimated jets 12 13 . Even such exotic final states 
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however, will coexist with a substantial tail of the mass distribution of light-parton QCD 
jets 14,15 . Thus it will generally be necessary to study jet substructure systematically to 



distinguish such a signal. 

Precise predictions for the substructure of such boosted jets are severely limited by the 
complexity of the final state: in general it will be difficult to find an experimental observable 
perfectly correlated with a parton shower history, including color flow and hadronization. 
Nevertheless, a number of methods to analyze high-pr jets have been proposed. Generally, 
these methods depend on differences in the substructure of light-parton QCD jets compared 
to those from particle decays. Diagnostics to detect this difference include infrared safe jet 
shapes 16 20 , direct analyses of jet substructure |3 , 21 34 , grooming methods to improve 
jet mass resolution by reducing jet contamination 35 37 , and matrix element methods |38|. 
Jet substructure techniques have also been studied in the context of specific particle searches, 
where they have been shown to extend substantially the reach of traditional search techniques 
in a wide variety of scenarios 39 -48 . In addition, experiments have started to use boosted 



hadronic objects as a probe of new physics in data 49 51] (for a summary of the experimental 



and theoretical progress in jet substructure, see, for instance, 52 53 



In Ref. 54 a template overlap method was developed for the quantitative comparison of 



the energy flow of observed jets at high-pr with the flow from selected sets (the templates) 
of partonic states. The template overlap method can be summarized as follows, adopting 
the notation of Ref. [54]. We denote a set of particles or calorimeter towers in a physical 
(or event-generated) jet by \j). The jet, of course, is identified by some algorithm. Any 
such jet is to be compared with a large set of configurations |/), which represent sets of 
partonic momenta p±, . . . ,pn that would be identified as a jet by the same algorithm we 
apply to the physical jets. For a given j, we determine the template state for which some 
measure, J-(j,f) = (f\j), is maximized. Although this measure is in principle completely 
free, we choose below a Gaussian in energy differences within angular regions surrounding 
each of the partons of the template. When the energy flow of the state perfectly tracks that 
of a template, the measure is maximized at unity. Any region of partonic phase space {/}, 
can define a template, although in this analysis we will be interested in templates that can 
describe boosted Higgs decays, in terms of the energies and invariant masses of its partons. 

Although applied to boosted Higgs in this paper, the template overlap approach is very 
flexible and can be applied to a wide variety of particle processes for which theoretical models 
have been established. For each set of templates with a definite number of particles, N, the 
overlaps provide us with a tool to match unequivocally arbitrary final states j to partonic 
partners f[j}. Once a "peak template" f[j] is found, we can use it to characterize the energy 
flow of the state, which gives additional information on the likelihood that it is signal or 
background. Similarly, it is possible to analyze the template functions found in this way to 
further discriminate events. We will give examples below. 



As was shown in 54 , the application of these ideas is particularly straightforward for 
top jets. Much of the QCD background is characterized by two sub-jets, with very different 
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energy flow from the three-parton templates in general. Indeed, for a lowest order partonic 
QCD jet consisting of the original parton plus one soft gluon, there is no template state from 
top decay that matches the energy flow. This gives a fundamental discrimination, to which 
we can add additional information from event shapes. 



In practice the procedure outlined in 54 is not yet optimal for disentangling a Higgs, as 
opposed to top, signal from a QCD background, because at lowest order (corresponding to 
templates with N = 2) both boosted Higgs and QCD jets consist of two particles. A success- 
ful identification strategy should also make use of observables that do not map exclusively 
onto the minimum number of partons. We go a further step in this direction by employing 
templates with three partons, with a cut to ensure that they are well-separated in phase 
space. To the extent that physical final states reflect partonic energy flow in both signal and 
background, the population of three-particle templates found from a given sample reflects 
the short-distance dynamics that produced the corresponding physical states. 

In the following section we review the definition of template overlap used in this paper. 
Next, in Sec. 3, we introduce an efficient procedure to construct the templates and maximize 
the overlap. The discriminating power of several observables, including planar flow, and 
several new partonic template variables is discussed in Sec. 4. We then study Higgs tagging 
performance of template overlap in Sec. 5. Finally, we conclude in Sec. 6, leaving a few 
explicit calculations to the appendices. 



2 The Template Overlap Method 



The method developed in 54 makes use of template overlaps that are functionals of energy 



flow of jet events. The templates are sets of partonic momenta / = {pi, . . . ,Pn}i with 

N 



E 



Pi = p, p 2 = m 2 , (i; 



which represent the decay products of a signal of mass M. Here we have adopted the ex- 
pectation that a good, if not the best, rejection power is obtained when we use the signal 
distribution itself to construct our templates (see (55]). For example, the lowest order tem- 
plates for Higgs have N = 2 and for the top decay, N = 3, with phase space in the latter case 
restricted by the W mass: t — > b + W — > b + (qq). The number of particles in the templates 
is not necessarily fixed, and templates with more than the minimum number of particles are 
possible. We will find, however, that combining templates in the full phase space for iV = 3 
and N = 2 already delivers encouraging results for the Higgs. 

In order to compare the energy flow of any given (physical or event-generated) jet to 
that of the signal on the unit sphere, denoted by Q, following the notation of Ref. [54], we 
represent the template energy flow as dE(f) / dfl while that of the jet is denoted by dE(j) / dQ. 
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We may then represent a general overlap functional Ov(j, f) between a jet and a template 



as 



OvUJ) = (j\f)=J 



dE{j) dE(f) 



dtt 



dtt 



(2) 



Here, the functional J 7 is in principle completely free, and by construction it is maximized 
by the corresponding template for a given j. The measure for the matching between j 
and the template / can be taken as the weighted difference of their energy flows integrated 
over some specific region in the phase space. In our analysis, we will take the functional iF 
as a Gaussian in energy differences within angular regions surrounding each parton in the 
template, i.e. around directions of the N template momenta Pi, 



Ov N (j,p u ...,p N ) 



max (k) exp 



JV 



a=l " 



d 2 n 



\{n,n{ f) ) 



E U) 



(3) 



( f) 

where the direction of parton a in the template is h„ and its energy is Eg'. These energies set 
the widths a a of the Gaussians; while the functions 0jv(n, ) restrict the angular integrals 
to (nonintersecting) regions with a definite width surrounding each of the partons of the 
template. The corresponding state will be referred as the "peak template" f[j] for state j. 

A new element of our analysis is to use templates with more than the minimum number 
of particles. This gives us the ability to resolve details of jet substructure, facilitating 
the capture of possible gluon radiation in the heavy particle decay, while still eliminating 
contamination from pile-up and the underlying event. 

A key step for defining template overlap is to choose appropriately the set of template 
states over which the maximization is performed. Ideally, one would determine Ov by max- 
imizing over all possible template states. In practice, however, one needs to introduce a 
discretization in the space of template states. In order to make sure that the maximum 
overlap found by this reduced set is very close to the true maximum, a large number of tem- 
plate states is needed, roughly of order 100 37V_4 . In general, such a maximization procedure 
can be computationally intensive when the number of template particles, N, is large, but in 
section [3] we present an efficient algorithm to perform this task. We emphasize that, once 
generated, the same set of template states is used for all the data. 

In summary, the output of the peak template method for any physical state j is the value 
of the overlap, Ov(j,f), and also the identity of the template state f[j] to which the best 
match is found. As we shall see, this will be of particular value when we apply our method 
to boosted Higgs. 
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3 Analysis overview 



Motivated by Ref. [3], we now describe the application of the template method to the pro- 
duction of a Higgs boson in association with a W boson, p + p — > W + H, followed by the 
dominant light Higgs boson decay, to two b-tagged jets, including schemes for generating 
templates and for discretizing the data. We wish to investigate the tagging efficiency for this 
process, and the fake rates from the background process p + p — > W + jets. We emphasize 
that the method we propose is quite general, and it can be used for other massive object 
searches as well. 

In this section, we describe the construction and use of templates with N = 3 for Higgs 
decay. Three particle templates will allow us to test the influence of gluon emission and color 
flow, through their effect on energy flow. This will provide an additional tool to discriminate 
between QCD and Higgs jets. For example, we expect soft radiation from the boosted color 
singlet Higgs to be concentrated between the b and the b decay products. This is to be 
contrasted to a jet initiated by a light parton, whose color is correlated with particles in 
other parts of phase space, producing radiation in the gaps between those particles and the 
jet system. In this way, the template for a given state can provide evidence on its origin. 
The template method has the advantage of not requiring any special algorithmic technique 
and allows us to consider more elaborate jet substructure observables, and we will look at a 
few specific kinds in the following sections. 



3.1 Matrix Element Optimization 



In the preliminary study of Ref. 54 , the template states, / = p%, ■ ■ ■ ,Pn, were generated 
evenly over all the phase space by a brute force method. This approach was very effective 
for the top at LO, t — > b + W — > b + (qq), where the pair invariant mass was constrained 
to be my/- However, the full, unconstrained 3-particle phase space translates to an increase 
of computational runtime. If we have iV-body templates described by 3N — 4 phase-space 
variables and we divide each of these variables into, say, ~100 bins, then we have ~ 100 3N ~ A 
total bins in the absence of additional kinematic constraints. A coarser discretization might 
generate errors in the estimation of Ov^ij, f), when the matrix element is changing rapidly. 
We here present a new scheme which significantly reduces the number of the template states 
needed for the computation of OvnU, /)• 

A more efficient approach is to generate templates in phase space according to the size 
of squared matrix elements of signal events. This will ensure that most signal events will 
match templates very closely. In regions where matrix elements are small, they are changing 
slowly, so that rare signal events will be well matched even where the density of templates 
is lower. Such matrix element weighting methods are simple to implement and allow one 
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to deemphasize the uninteresting majority of phase spac^] The idea is to associate to each 
template / a weight defined as the probability to produce and observe a template in a given 
model, labelled by the letter a. We can readily evaluate such a probability in the ideal 
situation where the resolution of the detector is perfect. The probability is then given by 
the squared matrix element, summed over final state colors and polarizations and averaged 
over initial ones, 

V(p\a) = — d$\M a (p)\ 2 . (4) 

Here M a is the matrix element, <i$ is the phase-space measure and a a is the total cross 
section. 

The generation of templates according to the cross section in Eq. Q involves three steps: 



• A sampling Monte Carlo routine is used to sample the phase space, {xW};i\ withr 
running over independent kinematical variables, according to the probability distribu- 
tion V(p\a). 

• The same algorithm computes the templates, i.e. the parton momenta {pi}iLi, from 
the set of {x^}^" 4 . 

• The matrix elements contain singularities in certain kinematic configurations. For 
example, those with final states of several jets are singular if the jets are nearly collinear 
or if the energy of a jet approaches zero. Hence, after the momenta are generated, a 
call is made to a routine that is used to apply cuts to the generated templates. A phase 
space point that fails the cut is then rejected and the template is not evaluated, and 
the state reverts to a template with fewer particles. 



3.2 Selection and Discretization of the Data 



We generate events for W + + H bb and W 

with large transverse momentum, using Pythia 8.150 



+ jets — > + jets in a configuration 
Sherpa 1.3.0 [61] (with CKKW 



60 



matching [62]), and Madgraph [63] interfaced to Pythia 6 |64| (with MLM matching [65]). 
Jets are reconstructed using Fastjet 66 , and the anti-fc^ algorithm |67| with large effective 
cone size R = 0.4, 0.7. In this paper, we have chosen plausible values for R, based on 
a combination of physics input and a trial-and-error, but have not attempted to optimize 
them systematically]^] For each event, we find the jet with the highest transverse momentum 
and impose a jet mass window for the Higgs. We choose the jet mass window to be 110 GeV 



*In different contexts, such an approach has been applied to Tevatron data 

^An optimal jet radius would be a compromise between taking it large enough to include perturbative 
final-state radiation, and small enough to avoid too much contamination from the underlying event and 
initial-state radiation. 
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< mj < 130 GeV, with our reference Higgs boson mass chosen to be m# = 120 GeV, and 
jet energy 950 GeV < Po < 1050 GeV. This gives us a set of final states j. 

For any state j, we determine the measured (or Monte Carlo (MC) generated) energy 
distribution, dE(j)/dfl, in the physical 8-<j) plane with respect to the jet axis for each recon- 
structed jet, and we discretize this data into a jet-energy configuration. In our demonstration 
for the Higgs, we discretize the 9-(f> plane into cells of size A9 = 0.04 and A0 = 0.1 ^\ Next, 
we again assemble a table of energies _E(row m , column n ), where row m and column n are the 
row and column number corresponding to the discretized values of 9 and <fi. 

Before proceeding further, let us mention a few words concerning our choice of event se- 
lection criteria and the interpretation of our results. Our focus in this work is to demonstrate 
that the combination of two- and three-particle templates can give a qualitative improve- 
ment in the separation of Higgs signal events from QCD background. In order to simplify 
the analysis we have restricted our study to very large transverse momenta, not realistic at 
this time of the LHC running. However, we regard the findings described below as a proof 
of principle, and we expect that more realistic parameters [69] (for example, a lower cut) 
and b— tagging will provide even higher discriminating power. 



3.3 Construction of template functions 

We define the leading order (next to leading order) templates in terms of the lowest-order 
(next-to-lowest-order) decays of the Higgs, schematically, 

l/) = l^) (LO) = bi,p 2 ), (5) 

and 

\f) = \h}^ = \p uP2 ,p 3 }. (6) 

Our templates will be a set of discretized partonic states corresponding to given points in 
phase space. We wish to generate an ample number of template states to cover both two- 
and three-particle phase space for Higgs decay. To this end, we use a sampling Monte Carlo 
routine to generate points in a region of phase-space of final-state momenta, with a density 



proportional to the differential cross section, as described in Sec. 3.1 above. 



We assume that the lifetime of the Higgs particle is long enough so that its decay is 
incoherent with the evolution of the hard scattering. Thus, the rest frame of the particle is 
identical to the CM frame of its decay products. Starting from the hard parton-level matrix 
elements, we can readily find the distribution of template momenta in the Higgs jet rest 
frame, and then, by a straightforward Lorentz transformation, boost it to the lab frame. 



■^In a typical experimental setup the energy is discretized according to the detector resolution, and each 
pair (row, col) corresponds to a specific cell in the calorimeter. At the LHC experiments |68 , for instance, 
electromagnetic calorimeter cell size (in r\ and <fr) is of 0(0.025 x 0.025) and of 0(0.1 x 0.1) for hadronic 
calorimeter cells. 
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For the two-body Higgs decay, two angles define the two-body state of the daughter 
particles. We choose these as the polar and azimuthal angles in the Higgs rest frame, relative 
to the boost axis that links the Higgs rest frame with the lab frame. In these coordinates, 
the Lorentz invariant differential cross section for the two-body Higgs decay has the simple 
form 

dr i ._. 

— = — d(p a cos v, (7J 

T 4:71 

where T Q is the tree-level decay width for a light Higgs, 



N c G F m H m 2 b f^_Am\ 



3/2 



with m& the 6-quark mass, ms the Higgs mass, Gp the Fermi coupling constant and N c = 3 
the color factor. 

By straightforward Lorentz transformations of particle momenta, the two angles identified 
above determine the energies and directions of the two decay products of the Higgs at LO. 
The two angular parameters 9 and <p are distributed according to (jTl). We generate a large 
set of template states so that we are confident of identifying the peak value of overlap. We 
can now encode the two physical angles in terms of row and column numbers, corresponding 
to the data discretization scheme. Each template consists of the information (row a , column a , 
Eg) for each of the two daughter particles. We exclude those templates having polar angles 
(in the lab frame) larger than the cone size R. 

Next, we consider the templates motivated by the on-shell next-to- leading order decays of 
the Higgs. In this case, more variables enter the problem. They can be chosen as {x\, x 2 , £3}, 
Xi denoting the fractional energies of the (massless) partons in the jet CM frame (normalized 
so that < Xi < 1, Y2 x i — 2), and three Euler angles if), 9, to parameterize the orientation 
of the decay products in the Higgs rest frame (see Appendix A). Since the Higgs boson is a 
scalar, its decay products are spherically symmetric and therefore distributed uniformly in 
if), 4> and cos 9. Specifically, in the rest frame of the scalar Higgs, the b, b quarks and the 
gluon, g have energy fractions given by X\, x 2 and x 3 , respectively, distributed according to 
the differential cross section, 

= 7~^C F a s % — Xl . .f 2 ^ + . 1 dxidx 2 dip d cos 9 d(f>. (9) 

r 87H (l - xi){i - x 2 ) 

Here a s is the coupling constant and C F = (N 2 — 1)/2N. There exists a certain region of 
phase space, corresponding to the soft or collinear emission of a gluon, in which the three- 
parton templates simulates a two-parton template. This is precisely the region in which the 
differential distribution ^ contains divergences, which cancel in the total transition proba- 
bility after the inclusion of virtual gluon exchange diagrams. We regulate these divergences 
in terms of the invariant mass of the pair of nearly-collinear partons, which has a simple 
relation to the variables Xi. For a partonic configuration to correspond to a three-particle 
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template, we require 



va = {Pl + 2 Pj)2 = & + x i - l ) > v, ( 10 ) 



m H 



where y is a cut-off parameter. In the following, we take y = 0.05. 

As for the two-parton templates, we generate a large number of three-parton templates, 
of order one million, distributed according to Q, while satisfying the constraint (10). By 



Lorentz transformations of particle momenta, analogous to the two-particle case, the two 
energy fractions and the three angles identified above determine the energies and directions 
of the three decay products of the Higgs at NLO. As for the discretization of the data, we 
again encode two physical angles in terms of row and column number corresponding to the 
data discretization scheme. A given three-particle template consists of a list (row a , column a , 
E a , a=l,2,3) for each of three daughter particles of NLO Higgs decay (b, b and g). We also 
exclude those templates having particles whose polar angles in the boosted frame, relative 
to the jet axis, are larger than the cone size R. 

Clearly, templates cannot be meaningful for zero momentum partons. However, the 



invariant-mass cut Eq. (10) and the requirement that the templates are within a cone of size 



R imposes a lower cutoff on the energies of the template partons, 

2m 2 H y 



P n R 2 



<E a . 11 



For the benchmark parameters used here, Pq = 1000 GeV, rriH = 120 GeV, y = 0.05, the 
corresponding cutoffs are E a > 3(9) GeV for R = 0.7(0.4). Although the cutoff correspond- 
ing to R = 0.7 happens to be particularly close to the hadronization scale, we checked that 
imposing a more stringent, explicit cut on our templates, namely E & > 20 GeV, did not have 
a sensible effect on the results. 



3.4 Two- and three-particle template overlap 

We are now ready to implement Eq. ^ for the Higgs, by defining an overlap between 
templates, |/), and jet states, Ov = (j\f). Defined as above, our two-parton templates 
each have two cells corresponding to two daughter partons (q and q) with their row and 
column numbers determined by the data discretization scheme. In addition, we have three- 
parton templates each having three cells corresponding to three daughter partons (q, q and 
g) with their row and column numbers determined by the same discretization scheme. 

We compute the overlap between data state j and two- or three-body template / from 
the unweighted sum of the energy in the nine cells of state j surrounding and including the 



9 



occupied cells of template state /, 



Ov N (j, f) = max (r) exp 

N 




ig+1 Jo+1 

E E{k,l)-E{i a ,j a )W 

Z=i a — \ l=j a — l 



(12) 



where iV = 2 or 3. Here, E(i a ,j a pf> is the energy in the template state for particle a whose 
direction is labelled by indices i a and j a , according to the discretization table described in 



Sec. |3.2[ If one of the sums extends outside the jet cone, we set the corresponding energies 
E(k,l) to zero. We fix a a (for the ath parton) by that parton's energy, <j a 



E(i a J a )W/2. 



The use of N = 2 and N = 3 templates follows three steps. Starting from a hard 
massive jet on angular scale R, one identifies the two subjets within it using two-parton 
templates. Within the same angular region, one then further takes the three hardest subjets 



that appear using three-parton templates, if the energies of three subjets satisfy Eq. ( 11 ). If 



this condition is not satisfied, only the two-body templates are used. The whole procedure 
is illustrated in Fig. [TJ 




3-b template 



Figure 1: Stages of the template method described in text. 



3.5 Parton-level studies 

To check the consistency of our method, we first test our ideas using parton-level events, 
which are simulated by a Monte Carlo sampling of Eq. (J9]). The Monte Carlo samples used 
for this validation include final state radiation up to 0(a s ). The events are conveniently 



regularized as in Eq. (10) via an invariant-mass cut algorithm with y = 0.05. The signal 
samples thus contain only q and q quarks and up to one gluon g in the final state. These events 
are not run through full parton showering and hadronization simulations. Consequently, 
the generated partons in each event are distributed according to the same squared matrix 
elements used in the template generation, Eqs. ff) and 
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In Fig. [2j the results are shown for the two- and three-body overlap given by Eq. (12). 
The data shows that, as required for consistency, each peak value Ov 2 (Ov 3 ) is close to unity 
for all the events in our Higgs decaying into a bb (bbg) sample, around 80% (20%) of the 



total (see Fig. 13 in Appendix B). Of the 22% with Ov 3 > 0.8, 14% have small Ov 2 while 



8% have substantial two-body overlap, 0.2 < Ov 2 < 1. As we will see below, the effect of 
showering is to spread out Higgs decays over the full range of Ov 2 and Ov^. 



Ov 2 



78% 2-body-like 



8% 2/3-subjet 
boundary 



14% 3-body-like 



Ov 3 




Figure 2: A scatter plot of two-parton template overlap vs. three-parton template overlap for 
LO parton-level MC output for Higgs decay, with jet energy, P = 1 TeV, uih = 120 GeV. 



3.6 Template overlaps for Higgs and QCD jets 

We now proceed to test the ability of two- and three-parton templates to distinguish between 
energetic Higgs jets from QCD jets. We apply templates to events generated by Pythia |60|, 



MG/ME [63], and Sherpa 61 . The event selection was described in Sec. 3.2 



In Fig. [3] we exhibit typical overlap distributions for showered Higgs and QCD jets (for 
the same jet mass and energy) for event generators Pythia (version 8) for 2—7-2 process 
without matching, MadGraph/MadEvent (MG/ME) (with MLM matching interfaced 
into Pythia V6.4), and Sherpa 1.2.1 (with CKKW matching). In the plots on the left 
panels of Fig. |3j we compare the template overlap distribution from Eq. ( p~2|) with N = 2 
for Higgs and QCD jets. The corresponding plots on the right panels of Fig7[3]show similar 
distributions when three-particle templates (N = 3) are used. Clearly, showering smears 
the Higgs distributions significantly, although Higgs events are concentrated at larger peak 
overlaps than QCD events for both N = 2 and N = 3. These differences, although not 
adequate to isolate a small signal, can serve as a foundation for further enrichment by use 
of additional analysis of template and jet states, as discussed belowj^] 

§Note also the moderate variations between the different Monte Carlo generators. Since the template 
overlap method depends on the precise radiation pattern within a jet, these variations are expected. 
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Figure 3: Comparison of histograms of template overlap Ov with Higgs jets and QCD jets 
from different MCs [from top to bottom, Pythia, Sherpa and Madgraph], for R = 0.7, 
950 GeV< P <1050 GeV, 110 GeV< mj <130 GeV and m H = 120 GeV using 2-body 
templates (left) and 3-body templates (right). 
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4 Single-variable discriminants 



While overlap distributions help enrich samples of Higgs events by capturing the constrast in 
the radiation pattern for the Higgs signal and QCD background, there is still a large tail of 
QCD background events with relatively large Ov. We proceed to develop and study several 
single-variable discriminants, which provide us with additional tools to maximize the sepa- 
ration between QCD and Higgs jets. Many of these variables, although intimately related, 
are worth studying individually, since they can be used to highlight different characteristics 
of jet substructure. 

Infrared-safe observables, such as jet shapes, guarantee that we can make meaningful 
comparisons between theoretical computations and their analogous experimental measure- 
ments. The method identifies the template that a physical state most closely resembles. We 
can also analyze, for example, angular distributions of template partons, considered as sub- 
jets, without the energy weights in event shapes. The template can thus be used to identify 
jet substructure [3 21 38 . The overlap method enables us to make subjet identification by 



providing a mapping between energy-unweighted variables and the template that defines the 
energy flow distributions. In what follows, we describe these two categories of tools that will 
allow us to further reject background events with a large Ov. 



4.1 Planar flow 

Jet shape variables are an especially interesting class of observables for jet studies, and 
have received considerable attention in the past several years in the context of boosted jet 



identification 16 17, 20 . The common feature of all jet shapes is that they involve moments 
of the energy of observed particles and are thus smooth functionals of energy flow within 
a jet. In this manner, they are complementary to the information provided by template 
overlaps, which is associated with jumps and spikes in energy flow. 



Following Ref. 54 , we will make use of the jet shape planar flow in the form, 



where 1^ is defined by, 



*u - — 2^ Ui ' (14) 



with rrij the jet mass, Ui the energy of particle % in the jet, and the k th component 
of its transverse momentum relative to the axis of the jet's momentum. Jets attributed to 
two-body final states have a differential jet function fixed at zero planar flow, 

1 / ^ ^ = 8(Pf). (15) 

2 body 



J KdPf 
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This would apply at leading order for events with highly boosted Higgs and QCD jets. On 
the other hand realistic QCD and Higgs jets have nonzero Pf, because of QCD radiation 
effects that smear the distribution. 

We expect soft radiation from the boosted color singlet Higgs to be concentrated between 
the b and b decay products. This is to be contrasted to a jet initiated by a light parton, 
whose color is correlated with particles in other parts of the event, producing radiation in 
the gaps between those particles and the jet system. Therefore, we expect that planar flow 
for Higgs jets will be peaked toward a lower value than that of QCD jets. In the studies we 
show below, the combination of Ov and Pf gives a strong background (QCD) suppression 
with quite substantial signal (Higgs decay) efficiency 

Higgs 



0.8 
0.6 

0.4 
0.2 



0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 

Figure 4: Density plots of planar flow Pf vs. template overlap Ov for Higgs jets and QCD 
jets from Pythia @, for R = 0.7, 950 GeV< P <1050 GeV, 110 GeV< m.j <130 GeV 
using three-body templates. 




Figure [4] shows the two-dimensional distributions of MC events (obtained via Pythia 
[60 1) in the Pf vs. Ov 3 plane for the signal and background. The scatter plot shows that 
signal events cluster around large overlap while, at the same time, Pf is essentially below 
0.2. By contrast, QCD events tend to be spread over the entire area. These plots also 
confirm our expectation that Higgs jets tend to have smaller Pf values than QCD jet events 
(for the same ratio mj/P ). Clearly, any set of events chosen from the bottom right of 
these plots, with Pf < Ov$, is highly enriched in three-body Higgs events compared with 
background. The clear difference in these scatter plots shows the potential of the template 
overlap method. 
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4.2 Template variables 



We will refer to the second set of variables for discriminating jets as template variables. 
We focus on variables which are sensitive to differences between jet radiation for signal and 
background. For the particular case of the Higgs boson signal, we aim to show the potential 
discriminating power of this category of variables. In future work it should be possible to 
improve upon our naive analysis below, which is based on simple rectangular cuts. 

We can test these ideas on planar flow. The Pf computed from the template of any 
event is a template variable, and can be compared to the corresponding Pf of the event. 
For comparison purposes, in Fig. [5] we show the two-dimensional distributions for the actual 
planar flow and its template variable version for Higgs jets. The data corresponds to Higgs 
jets from parton-level MC output based on Eq. ^ (left) and Pythia (right). The parton 
level plot validates the use of templates, since points are concentrated on the diagonal. The 
distribution from Pythia is slightly different, but the correlation between the actual planar 
flow and its template version is evident. One can see that the effect of showering is to smear 
the Pf distribution to larger values. 

Parton level 



0.8 
0.6 
0.4 

0.2 - 



0.2 0.4 0.6 0.8 

Template Pf 

Figure 5: Density plots of Pf vs. partonic template Pf for Higgs decay events from LO 
parton-level MC output (left) and Pythia (right), with P = 1 TeV, m H = 120 GeV. The 
intensity of the shading is proportional to the density of points. 

We first consider variables constructed out of templates with the minimum number of 
particles. This has been discussed in detail in [54], so our discussion will be brief. As 
described in [Tt] , at lowest order the signal phase space for the Higgs decay is characterized 
by simple kinematic parameters. For example, in Higgs decays, there are only two variables 
that characterize the decay h — > bb. One of them is the jet energy P , and a convenient 
choice for the other variable is the angle, 9 S , between the jet axis and the softer of the two 
particles. At fixed tuj/Pq, Higgs events tend to be peaked towards smaller values of 9 S than 



Showered with Pythia 




Template Pf 
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QCD jets 17 . Therefore, even within the two-body description, 9 S already provides useful 



information of jet energy flow. An analysis of this template variable and its application to 



two-body Higgs decay was presented in 54 



Once we include the contribution from leading order radiation to the Higgs decay, the 



phase space is now characterized by five variables, (xi, £2, 0) as defined in Sec. 3.3 
This gives us more freedom to contrive template variables. In the following, we discuss 
distributions in these variables, beginning with the energy fractions, X\ and x%. Here and 
below, x\ is the energy fraction for the b quark while X2 is that of b in the template. 

We may anticipate that x\ and x<i are not ideal discriminants between Higgs decay and 
QCD events, because both distributions have singularities at x\ — 1 and x 2 — 1, which reflect 
collinear and soft gluon emission. This is seen in Fig. [6j where we show two-dimensional 
distributions for {xi,X2} for Higgs and QCD peak templates. The top panels show the 
{xi,X2} distributions at the Madgraph level before showering, and the bottom panels 
after events are showered and jets reconstructed with Pythia and Fastjet. We have also 
imposed a quality cut on the three-body template overlap Ov 3 > 0.6. We can see that the 
discriminating power of the energy fractions is reduced by requiring an overlap cut in the 
showered events. While one might still be able to improve signal to background by drawing 
a contour around the collinear regions, it is clear that the {xi, 22} variables alone will not be 
very effective in discriminating boosted Higgs jets from QCD background, and in any case, 
in these regions we expect resummation effects to be important. 

Template variables based on angular distributions are more promising than energy frac- 
tions. To be specific, we will consider the following (lab frame) variables, and look at their 
distributions. 



The angles between the jet axis and the template momenta 9ij, 

X{ 77 



l-cos0 w =— (16) 



with z = ttij/Pq. 

The angular separations: # 12 , #13, #23, 



l-cos % =(*i±|=i«. (17) 



The angle between the jet axis and the softest of the partons: 9 S 

zx s mj 



1 — COS#o 



2E 



where E s = min{Ei}. 
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Figure 6: Density plots of X\ vs. x 2 for Higgs signal (blue) and QCD background (red), for the 
LHC Madgraph hard parton-level (top) and showered jet level with Pythia (bottom). 
Both are shown only in the Higgs mass window 110 GeV < mj < 130 GeV, with Pq — 1 TeV, 
m H = 120 GeV. 
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rg = min{0i3/#i2, ^23/^12}, found by finding the minimum of (1 — cos ^3), i = 1,2, 
given by 

. f (1 - x 2 )E 2 (l-x 1 )E 1 \ 



L (l-x 3 )£ 3 [l-x 3 )E 3) 
The three-body angular variable 9, 

d = J2 sindiJ > (20) 



with 9ij given by Eq. (16) 



The expression for the energy E^ of particle % is fairly simple, and is given in Appendix A, 



Eqs. (26 28). 



The distributions of the variables Eqs. (17)-(20) are shown in Fig. [7| All of these variables 
are shown for anti-fey R = 0.7 jets. Since our focus is on the difference in the shapes of various 
observables, all of the kinematic distributions are normalized to unity. The angular variables 
rg and 9 offer the promise of reasonable discriminating power, because they are directly tied 
to physical features of the signal, as follows. 

In the Higgs decay to a quark-antiquark pair and a gluon, h — > qqg, we expect events 
where the gluon is soft to be predominant. In the boosted frame, this radiation appears 
dominantly within an angular region spanned by the dipole formed by the quark and the 



antiquark 70 . In contrast, in the perturbative expansion, jets initiated by quark or gluon 
radiation would have a color connection with the rest of the event resulting in a bias for 
large angle soft gluon emission towards other jets in the event or the beam. One can take 
advantage of this in order to focus on regions in the phase space where Higgs decay events 
are more likely to dominate over QCD events (e.g. with ggg final state). 

The variables rg and 9 are designed to measure the difference in angular ordering in our 
peak templates between the Higgs boson signal and the QCD background. We note that both 
9 and 9\ 2 for three-body templates are somewhat analogous to 9 S for two-body templates. 



5 Higgs tagging performance 



In this section, we investigate the tagging efficiencies for Higgs jets and the mistag rates for 
QCD jets using template overlap. We apply it to events generated by Pythia 8.150 |60|, 
Sherpa 1.3.0 [61] (with CKKW matching (62)), and Madgraph [63l interfaced to Pythia 



6 [64] (with MLM matching [65]) also. The event selection was described in Sec. 3.2 



As seen in Sec. 3J), both two- and thee-body template overlap have substantial discrimi- 
nating power. This can also be seen in the scatter plots, shown in Fig. [81 of Ov 2 and Ov 3 for 
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Figure 7: A selection of various template variable distributions for Higgs jets (blue) and QCD 
background (purple) at the LHC. Events satisfy selection cuts and the Higgs mass window 
cut, 110 GeV < mj < 130 GeV. Horizontal axes are in radians or dimensionless units as 
appropriate, and vertical axes are in arbitrary units with signal and background normalized 
to the same area. 
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Higgs signal (left panel) and dijet production (right panel). While the signal events cluster 
around the upper right corner of the plot, most QCD jet events are localized diagonally 
opposite in the lower left. It follows immediately that making tight cuts on each observable, 
by drawing a rectangular window in the upper right corner of the scatter plot, makes a good 
discriminator to separate signal from background. 
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Figure 8: Density plots of 2-body overlap vs. 3-body overlap for boosted Higgs and QCD jets 
with R — 0.7 and same number of events (20000). 



We now assess the additional discriminating power offered by the template variables 



discussed in section In the simple analysis of this paper, we only look at the effect that 
a simple cut or window has on the number of signal and background events, leaving the use 
of more sophisticated methods for future work. We did not find that any of the template 
variables were qualitatively better than the planar flow cut. On the other hand, we found 
that planar flow and the 9 variable were somewhat complimentary. The two dimensional 
distribution Pf vs 9 for both Higgs jets and QCD jets is shown in Fig. [9] One can see that 
Pf and 9 are approximately independent and, thus, drawing a contour to separate signal 
from background will do better than any single straight line. 



In Fig. 10 , we summarize the boosted Higgs tagging efficiency versus background rejection 
using the two overlaps observables Of 2 and Of 3 and the two variables 9 and Pf. For a given 
lower cut on Of3 (denoted by the same color line in each frame) the efficiency is controlled 
by the lower cut on two-parton overlap Of 2. Each point on one of these curves corresponds 
to a specific choice of Of 2 at fixed Of 3, and hence to the set of points within a rectangle 
that includes the upper right corners of the corresponding scatter plots in Fig. [8] The 
results depend on the choice of Of3 cut, but it is clear that any cut above 0.8 leads to a 
substantial increase in efficiency. Without using properties of the data itself, such as jet 
shapes, it appears that purely template overlap variables can be used to remove significant 
amount of background. Planar flow cuts remove some of the background contributions, but 
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Figure 9: Density plots of Pf vs 9 for anti-fc^ R = 0.7 jets for Higgs signal (left) and QCD 
background (right) at the LHC with selection cuts and quality cut Ov 2 > 0.8 and Ov 3 > 0.8. 



9 distributions also show some remaining discriminating power. Once we combine the fake 
rate and efficiency from a jet mass cut (fake rate: ~ 10%, efficiency: ~ 70%) with template 
overlap, 9 and planar flow, we find, for example, at efficiency of 10%, a fake rate of 0.05% 
(with Ov 3 > 0.8, Pf < 0.2 and 9 < 0.4). 

Our final results for the Higgs jet case are summarized in Tables [T] and [2] for the three 
event generators and R = 0.4,0.7, respectively, that result from including these simple, 
naive one-dimensional cuts in Ot>2, Ov 3 , 9 and Pf at fixed signal efficiency of S = 10%. It is 
evident from the numbers presented that the template overlap method works well for events 
generated by any of the MC generators. In each case, we find a large enhancement of signal 
compared to background, typically of the order of fifteen or more. Taking into account the 
rejection of QCD jets by imposing a mass window, these numbers (for a single massive jet) 
are multiplied by factors of ten to twenty. The efficiency for finding a jet within the Higgs 
mass window is small for R = 0.4, since a small cone is unlikely to capture all of the QCD 
radiation in Higgs decay. For the bigger cone size R = 0.7, the efficiency is increased roughly 
by a factor of 2. 

For comparison, we have applied the BDRS Higgs tagger [3] as implemented by FastJet 



3 66 to the Pythia samples with the same jet radius and invariant mass window for QCD 
jets and Higgs jets. The BDRS Higgs tagger has two distinct parts: First, a mass-drop 
criteria to determine whether a jet contains substructure consistent with a Higgs boson 
decaying into bottom-antibottom quark pairs. If it passes the criteria, a jet is then filtered 
by reclustering it with a smaller jet radius Rfut and only keeping the 3 hardest of these 
subjets as the pieces of the final filtered jet. One can then place cuts on the mass of the 
filtered jets, mfu t , to further reject background from QCD jets. Our results can be seen 



in Fig. 11, which compares the Template Overlap method to the BDRS Higgs tagger in a 
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Figure 10: Comparison of fake rate vs. signal efficiency with various cuts on three-body 
template overlap Ov^ with Higgs jets and QCD jets. The curves represent variations on the 
minimal value of two-body template overlap Of 2. The frames show results from different 
MC [top Pythia, middle Sherpa, and bottom MG/ME with R = 0.4 (left) and R = 
0.7 (right)], for a Higgs mass window selection criteria 110 GeV < m,j < 130 GeV and 
950 GeV < Pq < 1050 GeV. Both efficiency and fake rates decrease as we increase the 
overlap cut. The dashed curves denote the case when Pf cut is implemented, while the solid 
curves have no Pf cut. 
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simple, non-optimized test. We see that the template-based approach and the BDRS Higgs 
tagger perform very similarly. While not an optimal comparison, this should serve as a useful 
point of reference to establish the effectiveness of Template Overlap. 




Figure 11: Efficiency versus mistag rate for the Template Overlap method and the BDRS 
Higgs tagger. The filled curves represent variations on the minimal value of three-body 
template overlap Of 3, while the dashed curve corresponds to a varying mass window for the 
filtered jet mass, uifut. The results are from Pythia 8 with R = 0.7 and same selection 
criteria as in the template case above. 



We present these results for demonstration purposes only, and have not carried out a 
systematic study of how to maximize rejection power. Because the template method natu- 
rally provides scatter plots like those in Figs. [8] and [9j we can imagine optimizing cuts on 
the data. We may also investigate improvements in the overlap functional Eq. (12). 



6 Conclusions 



The template overlap method has the ability to maximize the physics reach for massive jets 
by matching jet energy flow with that of a boosted partonic decay. Template overlap is a 
particularly interesting observable, since it directly measures the N-prong nature of a jet. In 
this paper, we have proposed and tested several modifications to the template overlap method 



as implemented in 54 , combining two- and three-body template states to the analysis of 
boosted Higgs. Using several MC simulated samples, we have shown that template overlap 
offers the promise of a successful boosted Higgs tagger, validating the preliminary study 
in 54 . We have demonstrated how the inclusion of three particle templates allows us to 
test the influence of gluon emission and color flow, through their effect on energy flow, and 
have illustrated its use through the construction of several partonic template observables. 
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MC 


Jet mass 
Higgs-jet efficiency 


cut only 

[%] fake rate [%] 


Mass cut + Ov +9+ Pf 
Higgs-jet efficiency [%} fake rate [%] 


Pythia 8 


60 


5 


10 0.10 


MG/ME 


60 


5 


10 0.10 


Sherpa 


50 


5 


10 0.10 



Table 1: Efficiencies and fake rates for jets with R = 0.4 (using anti-kT'- D = 0.4), 950 
GeV< P <1050 GeV, 110 GeV< mj <130 GeV and m H = 120 GeV. The left pair of 
columns shows efficiencies and fake rates found by imposing the jet mass window only. The 
right pair takes into account the effects of cuts in both Ov's, 9 and Pf in addition to the mass 
window. For the different MC simulations, we have imposed various cuts on Ov, 9 and Pf 
variables: for Pythia v8 |60l Ov 2 > 0.8, Ov 3 > 0.8, 9 < 0.4 and Pf < 0.2, for MG/ME [63] 
interfaced to Pythia v6 |64| ( with MLM matching [65]) Ov 2 > 0.8, Ov 3 > 0.8, 9 < 0.4 and 
Pf < 0.2 and for Sherpa [6l] (with CKKW matching Q) Ov 2 > 0.7, Ov 3 > 0.7, 9 < 0.45 
and Pf < 0.3. 



Different event generators give moderately different averages for our template overlaps. We 
nevertheless find in each case excellent and similar rejection power. For the Higgs case 
studied in this work we get a rejection power of order 1:200 when combined with a jet mass 
cut, with sizable efficiencies. 

We should emphasize that our event selection was chosen in a kinematical regime that 
at present is unrealistic for the LHC. However, our findings should serve as a proof of 



concept for many of the ideas, and, based on ongoing research 69,71 , we expect an extended 
phenomenological analysis to deliver similar qualitative behaviour in terms of rejection power. 
The fact that the rejection powers are strong is encouraging and should help motivate possible 
modifications and improvements of the template method with different analyses in mind. 
We have not yet made use of 6-tagging, which is a natural extension to the method. Since 
template overlap cuts are independent of 6-tagging, we expect the use of 6-tagging to result 



in significant improvements in background rejection 69 . Additional discrimination power 
would be possible using multivariate techniques, and we leave them for future study. Another 
point that we have not investigated in detail and that might be interesting in a future study 
is how it is possible to fix some of the parameters in an optimal way given the properties of 
the events we want to reconstruct. As we have seen in Section [5} the choice of the jet cone 
radius R has a strong impact on the performance of the method; and it might be interesting 



to see how R could be adapted dynamically when varying the jet selection criteria 72 . It 
is also worth noting that the template method is quite general and could be employed in a 
broader set of applications to both Standard Model and beyond the Standard Model physics. 
As the LHC continues to explore the energy frontier of particle physics, template overlap 
provides us with an interesting tool for further development of jet substructure techniques. 
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MC 


Jet mass 
Higgs-jet efficiency 


cut only 

[%] fake rate [%] 


Mass cut + Ov +9+ Pf 
Higgs-jet efficiency [%} fake rate [%] 


Pythia 8 


70 


10 


10 0.05 


MG/ME 


70 


10 


10 0.05 


Sherpa 


60 


10 


10 0.05 



Table 2: Efficiencies and fake rates for jets with R — 0.7 (using anti-kT'- D = 0.7), 950 
GeV< P <1050 GeV, 110 GeV< mj <130 GeV and m H = 120 GeV. The left pair of 
columns shows efficiencies and fake rates found by imposing the jet mass window only. The 
right pair takes into account the effects of cuts in both Ov's, 9 and Pf in addition to the mass 
window. For the different MC simulations, we have imposed various cuts on Ov, 9 and Pf 
variables: for Pythia v8 |60l Ov 2 > 0.8, Ov 3 > 0.8, 9 < 0.4 and Pf < 0.2, for MG/ME [63] 
interfaced to Pythia v6 |64| ( with MLM matching [65]) Ov 2 > 0.8, Ov 3 > 0.8, 9 < 0.4 and 
Pf < 0.2 and for Sherpa |( 



(with CKKW matching 62 ) Ov 2 > 0.7, Ov 3 > 0.7, 9 < 0.45 



and Pf < 0.3. 
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Appendices 



A Kinematic of three-body decays 



We consider the process pp — > HX — > qqgX of Fig. 12, where q and q are quark and 
antiquark and g is a gluon. In the approximation where the Higgs boson is exactly on-shell, 
the cross section for this process can be written in a factorized form as 

dcTpp^Xqqg = dlJpp^XH r ~- (21) 

1 

It follows that the rest frame of the Higgs is identical to the center-of-mass frame of its decay 
products. In that frame the three final state particles will lie in a plane. The phase space 
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Figure 12: A schematic diagram of a Higgs production process and decay. For our study, the 
Higgs boson is assumed to be exactly on-shell. 



can therefore be specified by giving three Euler angles (/0, 0, <p) that specify the orientation 
of the final system relative to the Higgs boson and fractional energy variables 

X i = "To = ~2 ' (22) 
m H /2 m z H 

with E* the CM energy, and 

< x t < 1 . (23) 

Here, g M is the Higgs momentum with q^ = m 2 H , pf are the momenta of the outgoing 
partons (q,q,g)- Energy conservation gives J^Xj = 2, which implies that only two of the Xi 
are independent. 

Let 9ij be the angles between the momenta of the partons % and j. One can relate 
the angles between the momenta to the energy fractions defined by Eq. (22). Momentum 
conservation gives the relation between the energy fractions of the massless decay products 
and the angles between their momenta in the rest frame of the Higgs, 

(l-cos%) = 2(1 7 Xk) , i^j^k. (24) 

X iXj 

We can also obtain the angles between the boosted momenta in any frame, 

2 

(l-cos%) = ^L.(l- Xh ) . (25) 
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The energies Ei after a Lorentz boost 7 = 7n are given by 



Ex = ^{1 + Pn 1 ) 1 m H x 1 , (26) 

1 2 y/S 

E 2 = -(1 + /3ni cos6»i2 - /3n 2 )im H x 2 , (27) 

2 XiX 2 

1 2 v^S* 

^3 = -(1 + /3ni cos 6»i3 + /3n 2 )im H x 3 (28) 

2 X1X3 



and we have defined, 



S = (l-x 1 )(l-x 2 )(l-x 3 ). (29) 
Here, the boost direction is given by a unit vector in terms of two Euler angles, 

n = (rii,n 2 ,n 3 ) = (cos </> sin 6, sin </> sin cos 6). (30) 

In our conventions, the initial z-axis is perpendicular to the plane of the decay in the rest 
frame of the decaying particle, and we chose the x-axis to point in the direction of the quark. 



B Templates at NLO 



As discussed in Sec. |3.1[ templates were generated in three-particle phase space with a 
density that reflects the NLO differential decay rate for Higgs decay, Eq. d9|). To be counted 



as a three-particle template, each pair of partons in the final state should have a fractional 
invariant mass, y^ > y, in Eq. (10), where in this study we have taken y = 0.05. As shown in 
Fig. [2j the output of partonic level Monte Carlos suggests that about 20% of events generated 
in this way will pass this cut. In fact, we can confirm this estimate and calculate NLO y 
dependence analytically, by integrating the differential rate, Eq. ^ subject to yij > y for all 
pairs Following familiar notation [70], this result divided by the total NLO decay rate 
is labelled ^3, the three-jet fraction. The explicit expression is 



C F a s 

4:71 



-4Li 2 2 + 



1 3 
+ 6y log ( — 2 

1 / \V V 



+ \og{y) (-4 log (2y 2 -3y + l) +41og(y) + 3) + (2y - l)(5y - 7) 

+ log(l-2y)(41og(l-y)-3)-6tanh- 1 (l-2y)] , (31) 



We can regard the remaining fraction of events as a two-jet fraction, labelled 

/ 2 = l-/ 3 . 



(32) 



The fractions f 2 and ^3 are shown in Fig. 13 as a function of y. The points correspond to 
the reference value, y = 0.05 used in our study. 
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